Review




Structured Review

Mestrelab Research data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Data Processing Workflow, supplied by Mestrelab Research, used in various techniques. Bioz Stars score: 86/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflow/product/Mestrelab Research
Average 86 stars, based on 1 article reviews
data processing workflow - by Bioz Stars, 2026-06
86/100 stars

Images

1) Product Images from "Accelerating natural product discovery with linked MS-genomics and language/transformer-based models"

Article Title: Accelerating natural product discovery with linked MS-genomics and language/transformer-based models

Journal: npj Antimicrobials and Resistance

doi: 10.1038/s44259-026-00206-7

Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed Workflow for Intelligent Structural Elucidation (WISE).
Figure Legend Snippet: Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed Workflow for Intelligent Structural Elucidation (WISE).

Techniques Used: Biomarker Discovery

A Overview of the cheminformatics analysis workflow (WISE) to annotate individual molecules from raw liquid chromatography-tandem mass spectral (LC-MS/MS) data of fermentation extracts. B Distribution of WISE scores for correct structures (Real) versus isobaric decoy structures (within 0.005 m/z) with different molecular formulae (Decoy) from the Critical Assessment for Small Molecule Identification (CASMI2022) dataset. Low and high WISE score thresholds of 0.35 and 0.65 respectively are annotated for reference. C Receiver Operating Characteristic (ROC) curve demonstrating the discrimination of correct structures from decoys with differing molecular formulae within a 0.005 m/z window. AUC = Area Under Curve. CI = Confidence Interval. Detailed data in Table .
Figure Legend Snippet: A Overview of the cheminformatics analysis workflow (WISE) to annotate individual molecules from raw liquid chromatography-tandem mass spectral (LC-MS/MS) data of fermentation extracts. B Distribution of WISE scores for correct structures (Real) versus isobaric decoy structures (within 0.005 m/z) with different molecular formulae (Decoy) from the Critical Assessment for Small Molecule Identification (CASMI2022) dataset. Low and high WISE score thresholds of 0.35 and 0.65 respectively are annotated for reference. C Receiver Operating Characteristic (ROC) curve demonstrating the discrimination of correct structures from decoys with differing molecular formulae within a 0.005 m/z window. AUC = Area Under Curve. CI = Confidence Interval. Detailed data in Table .

Techniques Used: Liquid Chromatography, Liquid Chromatography with Mass Spectroscopy

A Elucidated biosynthetic gene cluster BGC0000709, responsible for neomycin production. B Similarity (%) of each embedded protein and sequence similarity (%) of the top-scoring protein matches. C Predicted protein structures of the top matches for Fe-S oxidoreductase ( CAF33317 ) from neomycin BGC. D Heatmap of WISE scores for strain-media combinations derived from the liquid chromatography-tandem mass spectrometry (LC-MS/MS) data of fermentation extracts. Only strains with at least 1 media combination with non-zero WISE score are shown. Activated strains (if any) are referred to by their parent strain instead of individually (details of the activated strains are in Table ). E Structure of neomycin B. F Workflow of the co-occurrence strategy and matrix ranking table with 0.35 WISE score and 0.75 PLM score cutoff. Strains that have been cross validated are highlighted in green or yellow, indicating whether these compounds were present (green) or absent (yellow). Lines indicating higher confidence thresholds of 0.65 WISE score (in red) and 2.5 PLM score (in blue) are also included.
Figure Legend Snippet: A Elucidated biosynthetic gene cluster BGC0000709, responsible for neomycin production. B Similarity (%) of each embedded protein and sequence similarity (%) of the top-scoring protein matches. C Predicted protein structures of the top matches for Fe-S oxidoreductase ( CAF33317 ) from neomycin BGC. D Heatmap of WISE scores for strain-media combinations derived from the liquid chromatography-tandem mass spectrometry (LC-MS/MS) data of fermentation extracts. Only strains with at least 1 media combination with non-zero WISE score are shown. Activated strains (if any) are referred to by their parent strain instead of individually (details of the activated strains are in Table ). E Structure of neomycin B. F Workflow of the co-occurrence strategy and matrix ranking table with 0.35 WISE score and 0.75 PLM score cutoff. Strains that have been cross validated are highlighted in green or yellow, indicating whether these compounds were present (green) or absent (yellow). Lines indicating higher confidence thresholds of 0.65 WISE score (in red) and 2.5 PLM score (in blue) are also included.

Techniques Used: Sequencing, Derivative Assay, Liquid Chromatography, Mass Spectrometry, Liquid Chromatography with Mass Spectroscopy



Similar Products

99
Thermo Fisher cryoem data processing workflow
Cryoem Data Processing Workflow, supplied by Thermo Fisher, used in various techniques. Bioz Stars score: 99/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/cryoem data processing workflow/product/Thermo Fisher
Average 99 stars, based on 1 article reviews
cryoem data processing workflow - by Bioz Stars, 2026-06
99/100 stars
  Buy from Supplier

86
Mestrelab Research data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Data Processing Workflow, supplied by Mestrelab Research, used in various techniques. Bioz Stars score: 86/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflow/product/Mestrelab Research
Average 86 stars, based on 1 article reviews
data processing workflow - by Bioz Stars, 2026-06
86/100 stars
  Buy from Supplier

86
Unifi Inc data processing workflows
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Data Processing Workflows, supplied by Unifi Inc, used in various techniques. Bioz Stars score: 86/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflows/product/Unifi Inc
Average 86 stars, based on 1 article reviews
data processing workflows - by Bioz Stars, 2026-06
86/100 stars
  Buy from Supplier

90
KNIME GmbH data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Data Processing Workflow, supplied by KNIME GmbH, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflow/product/KNIME GmbH
Average 90 stars, based on 1 article reviews
data processing workflow - by Bioz Stars, 2026-06
90/100 stars
  Buy from Supplier

90
Illumina Inc workflow for processing illumina deep sequencing chip-seq data
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Workflow For Processing Illumina Deep Sequencing Chip Seq Data, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/workflow for processing illumina deep sequencing chip-seq data/product/Illumina Inc
Average 90 stars, based on 1 article reviews
workflow for processing illumina deep sequencing chip-seq data - by Bioz Stars, 2026-06
90/100 stars
  Buy from Supplier

90
Unifi Inc data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Data Processing Workflow, supplied by Unifi Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflow/product/Unifi Inc
Average 90 stars, based on 1 article reviews
data processing workflow - by Bioz Stars, 2026-06
90/100 stars
  Buy from Supplier

90
MathWorks Inc custom automated experimental design and data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Custom Automated Experimental Design And Data Processing Workflow, supplied by MathWorks Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/custom automated experimental design and data processing workflow/product/MathWorks Inc
Average 90 stars, based on 1 article reviews
custom automated experimental design and data processing workflow - by Bioz Stars, 2026-06
90/100 stars
  Buy from Supplier

90
DataLad Project datalad-based fairly big workflow for image data processing
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Datalad Based Fairly Big Workflow For Image Data Processing, supplied by DataLad Project, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/datalad-based fairly big workflow for image data processing/product/DataLad Project
Average 90 stars, based on 1 article reviews
datalad-based fairly big workflow for image data processing - by Bioz Stars, 2026-06
90/100 stars
  Buy from Supplier

99
Thermo Fisher cryo em data processing workflow
Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed <t>Workflow</t> <t>for</t> <t>Intelligent</t> Structural Elucidation (WISE).
Cryo Em Data Processing Workflow, supplied by Thermo Fisher, used in various techniques. Bioz Stars score: 99/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/cryo em data processing workflow/product/Thermo Fisher
Average 99 stars, based on 1 article reviews
cryo em data processing workflow - by Bioz Stars, 2026-06
99/100 stars
  Buy from Supplier

Image Search Results


Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed Workflow for Intelligent Structural Elucidation (WISE).

Journal: npj Antimicrobials and Resistance

Article Title: Accelerating natural product discovery with linked MS-genomics and language/transformer-based models

doi: 10.1038/s44259-026-00206-7

Figure Lengend Snippet: Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed Workflow for Intelligent Structural Elucidation (WISE).

Article Snippet: The data processing Workflow for Intelligent Structural Elucidation (WISE) is partially based on Mestrenova v15.0.1-35756 which is commercially available from Mestrelab Research S. L. ( www.mestrelab.com ).

Techniques: Biomarker Discovery

A Overview of the cheminformatics analysis workflow (WISE) to annotate individual molecules from raw liquid chromatography-tandem mass spectral (LC-MS/MS) data of fermentation extracts. B Distribution of WISE scores for correct structures (Real) versus isobaric decoy structures (within 0.005 m/z) with different molecular formulae (Decoy) from the Critical Assessment for Small Molecule Identification (CASMI2022) dataset. Low and high WISE score thresholds of 0.35 and 0.65 respectively are annotated for reference. C Receiver Operating Characteristic (ROC) curve demonstrating the discrimination of correct structures from decoys with differing molecular formulae within a 0.005 m/z window. AUC = Area Under Curve. CI = Confidence Interval. Detailed data in Table .

Journal: npj Antimicrobials and Resistance

Article Title: Accelerating natural product discovery with linked MS-genomics and language/transformer-based models

doi: 10.1038/s44259-026-00206-7

Figure Lengend Snippet: A Overview of the cheminformatics analysis workflow (WISE) to annotate individual molecules from raw liquid chromatography-tandem mass spectral (LC-MS/MS) data of fermentation extracts. B Distribution of WISE scores for correct structures (Real) versus isobaric decoy structures (within 0.005 m/z) with different molecular formulae (Decoy) from the Critical Assessment for Small Molecule Identification (CASMI2022) dataset. Low and high WISE score thresholds of 0.35 and 0.65 respectively are annotated for reference. C Receiver Operating Characteristic (ROC) curve demonstrating the discrimination of correct structures from decoys with differing molecular formulae within a 0.005 m/z window. AUC = Area Under Curve. CI = Confidence Interval. Detailed data in Table .

Article Snippet: The data processing Workflow for Intelligent Structural Elucidation (WISE) is partially based on Mestrenova v15.0.1-35756 which is commercially available from Mestrelab Research S. L. ( www.mestrelab.com ).

Techniques: Liquid Chromatography, Liquid Chromatography with Mass Spectroscopy

A Elucidated biosynthetic gene cluster BGC0000709, responsible for neomycin production. B Similarity (%) of each embedded protein and sequence similarity (%) of the top-scoring protein matches. C Predicted protein structures of the top matches for Fe-S oxidoreductase ( CAF33317 ) from neomycin BGC. D Heatmap of WISE scores for strain-media combinations derived from the liquid chromatography-tandem mass spectrometry (LC-MS/MS) data of fermentation extracts. Only strains with at least 1 media combination with non-zero WISE score are shown. Activated strains (if any) are referred to by their parent strain instead of individually (details of the activated strains are in Table ). E Structure of neomycin B. F Workflow of the co-occurrence strategy and matrix ranking table with 0.35 WISE score and 0.75 PLM score cutoff. Strains that have been cross validated are highlighted in green or yellow, indicating whether these compounds were present (green) or absent (yellow). Lines indicating higher confidence thresholds of 0.65 WISE score (in red) and 2.5 PLM score (in blue) are also included.

Journal: npj Antimicrobials and Resistance

Article Title: Accelerating natural product discovery with linked MS-genomics and language/transformer-based models

doi: 10.1038/s44259-026-00206-7

Figure Lengend Snippet: A Elucidated biosynthetic gene cluster BGC0000709, responsible for neomycin production. B Similarity (%) of each embedded protein and sequence similarity (%) of the top-scoring protein matches. C Predicted protein structures of the top matches for Fe-S oxidoreductase ( CAF33317 ) from neomycin BGC. D Heatmap of WISE scores for strain-media combinations derived from the liquid chromatography-tandem mass spectrometry (LC-MS/MS) data of fermentation extracts. Only strains with at least 1 media combination with non-zero WISE score are shown. Activated strains (if any) are referred to by their parent strain instead of individually (details of the activated strains are in Table ). E Structure of neomycin B. F Workflow of the co-occurrence strategy and matrix ranking table with 0.35 WISE score and 0.75 PLM score cutoff. Strains that have been cross validated are highlighted in green or yellow, indicating whether these compounds were present (green) or absent (yellow). Lines indicating higher confidence thresholds of 0.65 WISE score (in red) and 2.5 PLM score (in blue) are also included.

Article Snippet: The data processing Workflow for Intelligent Structural Elucidation (WISE) is partially based on Mestrenova v15.0.1-35756 which is commercially available from Mestrelab Research S. L. ( www.mestrelab.com ).

Techniques: Sequencing, Derivative Assay, Liquid Chromatography, Mass Spectrometry, Liquid Chromatography with Mass Spectroscopy